Corpus: ckb_wikipedia_2021_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 43 47 80 80 85
1000 670 850 959 962 980
10000 5262 8951 9600 9673 9777
100000 5262 8952 9601 9674 9778
1000000 5262 8952 9601 9674 9778


Zipf's diagram for sentence endings


Gnuplot diagram

1479 msec needed at 2024-10-02 13:06